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Abstract — In this worlfl we consider the joint precoding across 
K transmitters (TXs), sharing the knowledge of the user's data 
symbols to be transmitted towards K single-antenna receivers 
(RXs). We consider a distributed channel state information 
(DCSI) configuration where each TX has its own local estimate 
of the overall multiuser MIMO channel. The focus of this work 
is on the optimization of the allocation of the CSI feedback 
subject to a constraint on the total sharing through the backhaul 
network. Building upon the Wyner model, we derive a new 
approach to allocate the CSI feedback while making efficient 
use of the pathloss structure to reduce the amount of feedback 
necessary. We show that the proposed CSI allocation achieves 
good performance with only a number of CSI bits per TX which 
does not scale with the number of cooperating TXs, thus making 
the joint transmission from a large number of TXs more practical 
than previously thought. Indeed, the proposed CSI allocation 
reduces the cooperation to a local scale, which allows also for 
a reduced allocation of the user's data symbols. We further 
show that the approach can be extended to a more general class 
of channel: the exponentially decaying channels, which model 
accuratly the cooperation of TXs located on a one dimensional 
space. Finally, we verify by simulations that the proposed CSI 
allocation leads to very little performance losses. 

I. Introduction 

Network or Multicell MIMO methods, whereby multiple 
interfering transmitters (TXs) share user messages and al- 
low for joint precoding, are currently considered for next 
generation wireless networks With perfect message and 
channel state information (CSI) sharing, the different TXs can 
be seen as a unique virtual multiple-antenna array serving 
all receivers (RXs), in a multiple-antenna broadcast channel 
(BC) fashion. However, the allocation of the data symbols and 
the CSI to the cooperating TXs impose huge requirements 
on the architecture. The common solution is to use disjoint 
clusters to reduce the amount of data to be shared f2l, ||3|. Yet, 
clustering limits the performance of the system because of the 
interference created at the edge of the clusters. In this work 
work, we focus on the limited sharing of the CSI feedback 
bits while the sharing of the data symbol is considered in the 
parallel work l?)- We will show later that our approach also 
leads to a reduction of the user's data sharing needed. 

In recent works, adaptive allocation of the CSI feedback 
bits in a multicell cellular network has been studied ||5l- 
|[8l . However, we consider here the joint precoding from 

' This work has been performed in the framework of the European research 
project ARTIST4G, which is partly funded by the European Union under its 
FP7 ICT Objective 1.1 - The Network of the Future. 



TXs having their own local channel estimates. This setting, 
introduced in f9l-|Tr| as the distributed CSI (DCSI)-MIMO 
channel, opens up completely new research problems. In that 
case, the optimization of the CSI allocation has been discussed 
in im but only in terms of degrees of freedom. 

It is very intuitive that, in wireless networks, the precision 
with which a channel to a given RX should be known at a 
given TX depends on the distance between the TX and the 
RX. However, an analytical analysis has never been done and 
is the focus of this work. Particularly, we aim at answering 
the fundamental question: How should the CSI feedback be 
shared through the network? 

We start by analyzing a very simplified channel model, 
referred to in the literature as the Wyner model, in which 
the TXs and the RXs are placed on a one-dimensional space 
(e.g. a line) and receive signals only from a few neighboring 
TXs. This simplistic model has the advantage of being more 
tractable while still offering valuable insights on more realistic 
channels. This model has been introduced in lfT2l and has 
been very successful since, particularly to model cooperation 
in wireless networks [|T3]| - lfT6ll . The extension of the work to 
more general channel models is provided in ifTTl . 

In this work, building above an asymptotic analysis tak- 
ing into account the geometry of the network, we derive 
analytically for the Wyner model a CSI allocation allowing 
for a large reduction of the CSI required at the TXs at 
the cost of reduced performance losses. Moreover, our CSI 
allocation scales linearly in the number of TXs instead of the 
conventional quadratic scaling for full CSI sharing, while still 
performing well, and is thus an interesting solution to make 
large cooperations areas more feasible in practice. 

II. System Model 

We consider the distributed CSI (DCSI)-MIMO channel, 
in which K transmitters (TXs) transmit jointly using linear 
precoding to K receivers (RXs) equipped with a single an- 
tenna and applying single user decoding. Each TX has the 
knowledge of the K symbols to transmit to the K RXs due to 
a pre-existing routing protocol for user-plane data. Besides the 
knowledge of data symbol, each TX is supposed to acquire, 
through an unspecified feedback or sharing mechanism, its 
own estimate on the channel vectors to all the users, thus 
explaining the meaning of the term distributed. 



The channel is represented by the channel matrix H e 
^KxK ^jj^ jjjg transmission is described mathematically as 
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where i/i is the signal received at the i-th RX, hf £ C^^^ the 
channel to the i-th RX, r] = [771, ... , tik]^ S C^^^ the zero 
mean unit variance i.i.d. complex Gaussian noise (CAf{Q, 1)) 



B. Optimized CSI Allocation 

Depending on the CSI accuracy at a TX, the coefficients 
implemented will, or will not, be very different from the 
coefficients obtained with perfect CSI. Considering a realistic 
constraint on the total number of CSI bits transmitted via the 
multiuser feedback channel, which we denote by i?max, we 
want to find the allocation of the feedback bits maximizing 
the average sum rate. Thus, the optimization problem is then: 
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. variance 1.1. a. complex uaussian noise \y), i))- . . ^ ^ ^(j) ^ 

^ is the transmitted signal obtained from the symbol /r(3)i, ^ * ~ 

= [si,...,s/f]TeC^xi (i.i.d. CAA(0,1)) as '=^'=^ 



(7) 



SK 



(2) 



where T e C^^^^^ is the precoding matrix and ti e C^^^ 
is the beamforming vector used to transmit s; to RX i. We 
consider a per-stream power constraint ||ti|p = P. 

Our focus is on the maximization of the sum rate averaged 
over the fading distribution, where the rate of user i reads as 
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In this work, we study the high SNR regime such that it 
is interesting to introduce the Multiplexing Gain (also called 
degree of freedom or prelog factor) at RX i as 
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A. Distributed CSI and Distributed Precoding 

In the DCSI-MIMO channel, the «-th TX has its own 
individual estimate of its channel hf to RX i for all i, denoted 
by h[-'^^ and obtained from a Random Vector Quantization 
(RVQ) using bits. We focus on interference limited 

wireless networks working at high SNR so that we assume 
that Zero Forcing (ZF) precoders are used since they converge 
to the optimal precoder as the SNR increases. We denote by 
t] the beamforming vector transmitting symbol Si computed 
at TX j and which reads as 
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Although a given TX j may compute the whole precoding 
matrix T^^^ only the j-th row will be used in practice. Finally, 
the effective precoder is given by 
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More detail on precoding for the DCSI-MIMO channel can be 
found in Bill. 



However, this problem is intricate as the dependency of the 
sum rate as a function of the CSI estimate error does not have 
an easy structure. Based on the insight from the asymptotic 
analysis in the DCSI-MIMO channel ifTTI . we approximate 
this optimization problem by the following one: 

K K K 

minimize ^E[p,-tPCSi II ^^^^ EE^P^ ^ 

{^i } 1=1 j=i i=i 

(8) 

where t^'~^^ is the beamforming vector transmitting symbol Si 
using perfect CSI and is given by 
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The intuition behind this figure of merit follows from the fact 
that, in the DCSI-MIMO channel, the consistency between the 
precoders implemented at the TXs has a critical impact on the 
performance. 

It is widely believed that the accuracy of the CSI at a TX 
should somehow be a function of how much interference it 
actually contributes to each RX. However, despite the intuition, 
no previous paper has identified how the CSI quantization bits 
should be allocated throughout the network. Building upon 
some simplified models, we will show in the following that it 
is possible to obtain some quantitative results which can then 
be used as guidelines for practical systems. 

III. Wyner Model 

We consider a finite linear version of the Wyner model 
where K TXs equipped with only one antenna are uniformly 
distributed along a line. A RX receives interference coming 
from its two direct neighboring TXs with an inter-cell atten- 
uation factor equal to fi £ (0, 1), while the short term fading 
is assumed to be Rayleigh distributed. Thus, the multiuser 
channel matrix H G £,kxk ^ tridiagonal matrix defined as 
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where bi, di, and Ui are distributed as standard i.i.d. complex 
Gaussian (CJ\f{0, 1)). We denote the estimates of bi, di, and a; 
at TX j by b'f \ d'f \ and a'f \ respectively, while the estimate 
errors are then A6p\ ^d'f\ and Aap"*, respectively. 

A. Inverse of a Tridiagonal Matrix 

The Wyner model allows to obtain a closed form for the 
channel inverse |18|, which we recall in the following and 
will be useful to quantify the effect of the distributed CSI. 

We start by introducing two K + 1 dimensional vectors 
(3 = [Pi,. . . , fHx+i] and a — [ao, ■ ■ ■ , ax] which will be 
used in the matrix inverse and are defined as 
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1 , if i=K+l 

P^^{dK ,ifi=K . (11) 

^dil3i+i - p?bi+iaiai+2 , if 1=1,- • ■ ,K-1 

The channel inverse H^^ then reads as follows. First, the 
diagonal elements for i = 2, . . . , — 1 are 



^'^bi+iail3i+2 ii'^biai-iai-2 



while the two extremal diagonal elements are 
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Last, the off-diagonal elements are given by 



(14) 

B. CSI Allocation in the Wyner Model 

The optimization of the CSI allocation in the DCSI-MIMO 
channel is still involved, even in the Wyner model, as it 
requires evaluating the impact of the CSI estimate errors in 
the coefficients computed at each TX based on its own local 
CSI. Thus, we consider instead an approximation and study the 
CSI allocation at high SNR. We start by recalling the following 
asymptotic result in the DCSI-MIMO channel. 

Theorem 1. i l77l/ To achieve the maximal MG in the DCSI- 
MIMO Rayleigh fading channel from K TXs to K RXs, it is 
necessary and sufficient to fulfill 

Vi e {1, . . . , K}, lim E\\\U - = 0(1) (15) 
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which is achieved with ZF using RVQ if and only if 
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Note that the K — 1 in ( fTSI l is replaced here by a 2 as it 
corresponds to the non-zeros elements in the channel vector. 

The optimal CSI allocation can be seen to converge to 
the uniform CSI allocation as the SNR increases. However, 
a uniform allocation is clearly very suboptimal at finite SNR, 
particularly in a large network with pathloss, where a distant 
TX emits very little interference. Therefore, the geometry of 
the network has to be used in order to derive an efficient CSI 
allocation at finite SNR. 

We can observe in the expressions given in Subsection llll-AI 
for the inverse of a tridiagonal channel that the amplitude 
of the coefficients in the inverse decreases exponentially as 
the elements get away from the diagonal. This is actually a 
consequence of the Wyner model which models the pathloss 
attenuation by considering interference only from direct neigh- 
bors. In the following, we make use of this property to derive 
an efficient CSI allocation. 

To derive analytical results at high SNR, we consider that 
the inter-cell coefficient /i can be written as a fixed fraction 
of P, i.e., P^"^ for a certain e e (0, 1). Thus, the inter- 
cell coefficient decreases as the power increases. Note that 
this is only an artefact to model the impact of the inter-cell 
attenuation in the high SNR regime and that we show by 
simulations that the performance remain good as fj, increases. 

Theorem 2. Considering that ji = P^^ for a fixed e £ (0, 1) 
and that random vector quantization (RVQ) is used, it is 
sufficient in order to achieve the maximal MG to quantize 
the channel vectors hJ^ at TX j with a number of bits 
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Proof: We start by inserting the CSI error estimates in the 
off-diagonal elements of the inverse given in Subsection llll-AI 
We consider the coefficient with j < i as the case j > i 
follows by symmetry. In a first step we consider the coefficient 
at TX j corresponding to the transmission of stream i. 
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where we have considered that the estimation errors are small 
as the CSI allocation will be derived so as to fulfill ( fTSl l. so 
that we can do a first order approximation in the estimation 
error terms. To proceed, we approximate the first ratio in ( fTSl l 
by keeping only the first order coefficients in ji which gives 
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To obtain an approximate closed form for the channel inverse, 
we also need to approximate the diagonal term defined in (fTZt 
in the denominator of the Left Hand Side (LHS) in ( fTsT l which 
is done as 
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To fulfill condition ( fTSl ). the sum of all the error terms 
arising in ( fTSl ) has to tend to zero as 0(1/ P) due to the fact 
that the channel inverse elements given in ( fTSI l are further 
normalized so as to fulfilled the per-stream power constraint 
of P. 

Inserting ( fT9] l and (l2Tl i in (fTSl l we conclude by inspection 
that this condition is fulfilled on the first order when a) and 



d^P for ^ e {i — 1, i — 2, . . . , j} as well as d-"" are obtained 
from the quantization of the channel vector hg for I ^ {i ~ 
l,i-2,.. with max(21og2(PA*^('"-'^),0) bits. 

This conditions corresponds to the transmission of symbol 
Si at TX j and is a consequence that the average power used 
for that coefficient is 0(P^^('^^^) instead of 0{P) such that 
the accuracy of the CSI can be reduced. 

These requirements on the CSI quantization correspond 
to the strongest requirement for that stream at that TX, 
but the other channel parameters need obviously to be also 
known, although with a lower accuracy. Particularly, no CSI 
requirement has been made on the by It is thus necessary to 
improve the accuracy of the approximation done in (fTSl l and 
(fT9] l. Using the definition of ai, we can write 
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after keeping this time the first order term. The ratio of the two 
aP can then be approximated by the product of the dg using 
the zero-th order approximation as in iT% . Inserting this ex- 
pression in ( fTsl l. we obtain the additional conditions that b^ , 
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bits, from which only the conditions on b^ and dlf^i are new. 

Proceeding similarly for increasing number of terms kept, 
we increase the accuracy of the approximation made in d^Tt 
to the first for order to obtain the condition that b/^^, al , 
and a[^\ should be quantized from a RVQ with at 



least max(2 log2(P/i' 
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) , 0) bits, from which only the 



Increasing further the accuracy of the approximation leads 
to a requirement for each term. However, for a given TX, 
this has to be done for all the streams to be transmitted. Thus, 
only the strongest requirements are of real interest in our case. 
Considering only the three constraints derived previously to 
have dj_-^ and dj from a RVQ with max(21og2(P^^''*^^'), 0) 
bits, and b^^^ from a RVQ with max(21og2(PA'^(*"^+^'), 0) 
bits, we can see that considering only these three constraint 
for all the streams transmitted leads to the most constraining 
requirements. 

Additionnaly, as the elements from the vector are quantized 
jointly, we obtain the final requirement for the channel inverse. 
The last point to discuss comes from the normalization fol- 
lowing the power constraint fullfilment. Indeed, each vector 
of the channel inverse if divided by its norm and multiplied 
by a/P- On the zero-th order, the norm of the channel inverse 
beamformer is equal to 1/di, which has then to be quantized 
with max(2 log2(P/i^*-*~-'^), 0) bits. This is already the case 
which conclude the proof. ■ 

Remark 1: In fact, the difference of required accuracy 
between dj , and the off-diagonal elements Uj and by 
comes only from the fact that the attenuation factor /i is known 
such that the outer-diagonal elements can be factorized into 
the attenuation factor /i and the Rayleigh fading part. Thus, 
the estimation error is directly attenuated by the inter-cell 
attenuation factor fi. 

yet, this result should be interpreted in the right way which 
is that the estimation error for the three coefficients (xaP , 
(1) (i) 

dy and /i^j should have the same variance. Another way to 
understand it, is that the variance of the interfering coefficient 
is smaller so that the number of bits necessary to quantize 
it with the same accuracy is smaller Using random scalar 
quantization for each coefficient with the knowledge of the 
attenuation factor, the condition given in the proposition can 
be replaced by the condition 
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Remark 2: A per stream power constraint has been con- 
sidered instead of a constraint for the constraint over the 
whole precoder. An additional per antenna constraint could be 
further introduced but this would complicate the analysis. The 
property that has to be fulfilled to allow for the optimization 
with only local CSI is that the optimization is considered for 
each stream separately. Indeed, using a total power constraint 
would lead the TXs to normalize the precoder using the 
beamforming vectors corresponding to users very far away, 
i.e., using very inaccurate CSI. This is problematic in the 
DCSI-MIMO channel as this reduces the consistency between 
the TXs. 

Intuitively, the CSI allocation is based on the fundamental 
property that a given TX contributes a large power only to 
the streams transmitted to neighboring RXs. For the symbols 
transmitted to distant RXs, it contributes to the transmission 
of the stream with only a small power, so that a less accurate 



CSI is needed to maintain a constant interference level created 
by the transmission of that stream. 

The CSI allocation derived in Theorem |2] reduces signifi- 
cantly the requirements on the CSI. Particularly, we will show 
in the following that it leads to the transmission of the CSI only 
at a local scale, thus giving a CSI by TX whish does not scale 
with the number of TXs. In relation with this fundamental 
property, we call the CSI allocation given in Theorem |2] as 
the Decaying CSI allocation. 

C. CSI Scaling in the Number of Cooperating TXs 

In the following, we consider the total number of feedback 
bits shared through the backhaul network which is of interest 
to us, as it is a benchmark of the feasibility of the cooperation. 
Particularly, we are interested in the scaling behavior of the 
total number of bits for large number of cooperating TXs. In 
the full CSI allocation given in Theorem [T] it can be shown 
to be equal to: 
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where we have denoted by i?'^^ the total number of feedback 
bits allocated to TX j. The total number of feedback bits 5^"" 
increases quadratically with the number of cooperating TXs. 
This is problematic for large cooperation areas and is no longer 
the case in the proposed CSI allocation. 

Proposition 1. For finite P and p,, the CSI allocation given 
in Theorem\2\scales linearly in the number K of cooperating 
TXs, which means that the number of CSI bits required by 
TX/RX pairs does not increase with the number of cooperating 
TX K. 

Proof: Let consider and P to be given. The attenuation 
factor /i takes its value in (0, 1), so that it exists a natural 
number Kq verifying /i^" < l/-\/P. We consider K to be 
very large (and odd to simplify the notation) such that we can 
consider wlog a TX placed in the middle of the cooperation 
domain. Using the CSI allocation from Theorem |2l 

K K 

(24) 

= 2^max(21og2(P/i2''),0). 

i' = l 

Letting K tend to infinity, the sum has only a finite number of 
positive terms as the logarithm will be negative for i' > Kq. 
Thus the B^^'> do not scale with K which ends the proof. ■ 
The insight behind the limited CSI allocation by TX is that 
the exponential decay of the off-diagonal coefficients in the 
precoding matrix is exploited to reduce the cooperation to a 
local scale. 

D. User's Data Symbol Allocation 

In this work, we have considered the user's data symbols 
to be shared between all the TXs. This is very constraining 
particularly when considering networks of large size. 



yet, our analysis for the CSI allocation has introduced a 
notion of local Cooperation which will use in the following 
to derive a routing solution for the user's data symbol which 
also requires only local Cooperation. 

Proposition 2. Considering that fj, = for a fixed e G 
(0, 1), sharing the symbol Si only to the TXs which verify 
21og2(P^^I*~^l) > leads to no loss in MG. 

Furthermore, for finite P and /i and increasing K, this 
user's data symbol sharing scales linearly in the number of 
TX/RX pairs, i.e., each symbol is shared only to a finite number 
of cooperating TXs. 

Proof: The proof follows easily from the proof of Theo- 
rem |2] because when 2 log2(P/^'^'*^"'') < 0, TX j is allocated 
with no CSI such that this would lead to a MG if the 
power used to transmit that symbol were positive. Thus, TX j 
does not participate to the transmission of symbol Si, which 
therefore doesn't need to be shared to TX j. 

The finite number of cooperating TXs follows then directly 
from the same property for the CSI allocation given in 
Propositions [T] ■ 

The property of sharing the symbols only to a finite number 
of TXs is interesting as combined with the local CSI alloca- 
tion, it gives a complete alternative to clustering algorithms. 
Both solutions have similar requirements for the backhaul and 
can be fairly compared. 

Thus, our alternative clustering solution based on the De- 
caying CSI allocation could then be implemented in realistic 
settings. In fact, we will see in the following sections, that the 
results can be generalized to more realistic settings. 

E. Extension to More General Band Limited Channel Models 

The analysis was carried out in a symmetric and regular 
linear Wyner model but can be extended without difficulties 
to aribtrary banded matrices using bounds on the amplitude 
of the off-diagonal coefficients fTol, which we recall in the 
following. 

Theorem 3 (Demkol984). Let A be an operator on a linear 
space, m-banded, centered, bounded and boundedly invertible, 
then 

{A-i},,, < CAI*-^I (25) 



where 
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This bound can be used to determine a reduced CSI allo- 
cation similarly to previous done. Due to the dependency in 
the condition number, the bound has to be recalculated for 
each channel realization but this would also be the case in 
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Fig. 1. Average rate per user in terms of tlie normalized ti'ansmit power per 
TX for an inter-cell attenuation factor /i = 0.5. 

the previous analysis if no asumption was being made on the 
value of the attenuation factor /i. 

The exponential decay of the amplitude of the channel 
inverse is proven in Theorem [3] for arbitrary matrices. This 
property is the basis for our approach such that the decaying 
CSI allocation can be extended to banded matrices. For 
brievity concerns, this is not further discussed. 

F. Simulations for the Wyner Model 

We simulate the average sum rate in the Wyner model 
presented at the beginning of Section for a network made 
of 25 cooperating TXs applying distributively a ZF precoder 
as given in For comparison, we show the performance 
achieved when perfect CSI is available at all TXs. To evalu- 
ate the efficiency of the Decaying CSI allocation proposed 
in Theorem |2] we also compare it to two alternative CSI 
allocations using the same total number of feedback bits: the 
uniform CSI allocation and the overlapping cluster allocation. 
Clearly, the first one consists in allocating the bits uniformly. 
The second one consists in an improved clustering solution 
where TX j has the knowledge of the channel vector hf for 
i G {j — nduster, ■ ■ ■ ,j + "-cluster} with the number of bits uni- 
formly allocated to all TXs and channels. In the simulations, 
we consider nduster = 2, such that the number of cooperating 
TX is 5. This allocation has been used to improve on the 
usual clustering and is itself already a novel CSI allocation. 
Note that this allocation will be seen to perform quite badly 
in the Wyner model, but to have very good performance in 
the more realistic model discussed in Section |IV] 

In Fig. ??, we plot the average rate per user as a function 
of the normalized transmit power for /i = 0.5. The Decaying 
CSI allocation outperforms clearly the overlapping cluster 
solution. In fact, the decaying allocation could be shown to 
perform better than the overlapping cluster allocations for all 
the size of the clusters and is somehow approximated by an 
overlapping cluster allocation with the size changing in terms 
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Fig. 2. Average rate per user in terms of the inter-cell attenuation factor /i 
for a normalized transmit power per TX equal to 20 dB. 

of the SNR. 

The uniform CSI allocation becomes efficient only at very 
high SNR, when the pathloss does not any longer have a 
strong impact compared to the SNR, while the decaying CSI 
allocation performs well at all SNRs. The percentage of CSI 
used in the decaying CSI allocation compared to the full CSI 
allocation in iT5[ increases as the SNR increases. Indeed, 
the interceU attenuation factor /i is fixed, such that the CSI 
allocation converges to the uniform CSI allocation at high 
SNR. For a transmit power per TX of 20 dB with /i = 0.5 
it is equal to 0.43 and to 0.63 for a transmit power of 45 
dB. Furthermore, the percentage of CSI allocated also clearly 
decreases when the value of fi decreases such that the solution 
proposes becomes even more interesting. 

In Fig. [21 we have plotted the average rate per user as 
a function of the inter-cell attenuation factor fi for a given 
normalized transmit power per TX equal to 20 dB. As the 
attenuation factor /i becomes larger, the difference between 
the perfect CSI configuration and the one with optimized CSI 
allocation increases but for all values of fi the optimized CSI 
allocation remains very performing compared to the other 
solutions. As expected, when /i = 1, there is no inter-cell 
attenuation and the optimized CSI allocation becomes equal 
to the uniform CSI allocation. 

IV. CSI Allocation in Exponentially Decaying 
Channels 

The Wyner model represents an interesting model for its 
simplicity but is clearly a simplistic modeling of the real 
transmission. One of the main limiting asumption is the limited 
local interaction between the TXs and we will now introduce 
a model which lifts this restriction. We consider thus a more 
realistic channel model where all the links are non-zeros but 
the channel matrix has an exponentially decaying structure. 
This means that the amplitude of the {i,j)-th element of the 
channel matrix decreases in an exponentially fashion as a 



function of the difference \i — j\. Intuitively, this means that 
the weight of a coefficient descreases as it goes away from 
the diagonal. 

In this section, we will consider for simplicity a regular 
model for the exponential decaying matrix. The channel matrix 
H is then made from the element-wise product of a standard 
Rayleigh fading channel matrix G e ^kxk ^j^^ ^ Toeplitz 
matrix generated from the vector [1, /i^, . . . , ji^^^]: 
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(27) 

Such a channel matrix corresponds to practical transmission 
scenarios, e.g., from the joint transmission from a set of TXs 
set regularly along a road. In a first part, we will start by giving 
some mathematical properties of the exponetially decaying and 
we will then show how these results can be used to extend the 
approach from the previous Section to that setting. 

A. Mathematical Prerequisite on Exponentially Decaying Ma- 
trices 

We start by defining rigorously an exponentially decaying 
matrix. 

Definition 1. An infinitely large matrix A is said to be 
exponentially decaying // it exists a 7 > such that 

V7' < 7, |{A}.,,| < c'{i)exp{-i\i - j\) (28) 

where c'(7') is some constant proper to the matrix A. In 
II2OI - II22II . an involved functional analysis is done to extend 
the particularly interesting work ll23l from which the relevant 
properties are recalled here. 

Proposition 3. [23] Let A be an invertible matrix exponen- 
tially decaying with a coefficient 7, then A^^ is exponentially 
decaying for a coefficient 7'. 

This proposition allows to obtain information on the struc- 
ture of the channel inverse, which will be our ZF precoder, 
based on the structure of the channel. It implies that the set 
of exponentially decaying matrices is closed under inversion. 
Note however, that the constant 7' is a priori different from 
7, but we will in the following assume the constrant 7' to 
be known. As this constant depends only on the pathloss 
attenuation, it appears a realistic assumption that a long term 
channel estimate is inverted so as to estimate the value of the 
decay rate for the inverse 7'. 

Proposition 4. [23] Let A and B be two exponentially 
decaying matrices for a coefficient 7 and let il be any set 
of indices. If 

|{A},,, - {B},j| < cexp(-7(d(i,f]) + d{j,n))) (29) 



where c is some arbitrary constant and the distance d[-,^) 
denotes the minimal distance to an element in the set fJ. It 
holds then 

|{A-iKj-{B-iK,| <c'exp(-7'(d(*,r!) + d(j- r!))) (30) 

where c' is another constant. This proposition which was 
called in ll23l the Windows Lemma can be in fact very 
intuitively (and linked to its name). Due to the exponentially 
decaying structure of the channel, perturbation of one element 
of the matrix to invert A has only an impact on the coefficients 
the inverse A^^ located around the element considered. This 
comes from the exponentially decaying structure and has for 
consequence that only local knowledge of the matrix A is 
needed to obtain an arbitrary good approximation. 

This property can be put in relation with another useful 
Theorem which states that an exponetially decaying matrix 
can be approximated arbiratry well by band Umited matrices 

El. 

We will now show how it is possible to use this property 
of local inversibility to derive a CSI alloction which is more 
stingy in how the CSI is allocated. Thus, the CSI allocation is 
reduced very strongly so as to fulfill only the most necessary 
needs. 

B. CSI Allocation in Exponentially Decaying Channels 

We consider now the exponentially decaying channel matrix 
defined in ( |27| |. We discuss the MG achieved in that case when 
the p tends to zero as P tends to infinity. As already explained 
for the Wyner model, this is only an artefact to model the 
pathloss attenuation and at finite SNR, both are finite and we 
will see by simulations that the approach is efficient in realistic 
conditions. 

Before stating the main result, we start by applying Proposi- 
tion[3]to the channel matrix H which is observed to decay with 
the rate 7 = — \og{p). Thus, we conclude from the proposition 
that the channel inverse H^^ is exponentially decaying with 
the decay rate 7'. By similarity to the structure of the channel 
matrix, we define p' — exp(— 7') G (0,/i]. 

Theorem 4. Considering that tends to zero as P tends to 
infinity, a sufficient CSI allocation to achieve the maximal MG 
using RVQ is to quantize the channel vectors hf at TX j with 
a number of bits equal to: 

B^^ = {K- l)max (log, (^)) ,o) . (31) 

If a uniform scalar quantization is used, and we define the 

a) (1) 

number of bits quantizing H^^ by B^j, , then it is sufficient to 
achieve the maximal MG to let 

Bli^ = --(l°g2 ((^ (^) /^^"^-l) ,0) . (32) 

Proof: We first notice that the channel matrix H defined 
in ( |27] ) is exponentially decaying with decay constant 7 = 



— log(/i). The proof will be based on the use of the "Windows 
Lemma" |4] with the two matrices H and H^^^. We first notice 
that it exists a natural number k' such that fi''' < 1/P with 
^' = exp(— 7') defined from 7' the decay constant for the 
channel inverse H^^. We define Kj^ as the smallest natural 
number verifying this property which means that 

-log(^) " 

\og{^l) 

Thus, we Kq fulfills the relation 



(33) 



log(/i) 



which we can rewrite as 
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(34) 



(35) 



We also define the set ^l as 



n = {{j+K+l. 1), . . . , {j+K+l,K), ij~K'o-£, 1), . . . , (j-Kl,- 

(36) 

In fact, the set O is simply made of all the rows such that 
there are at least Kq — 1 rows between them and the j-th row. 

Let consider now that the CSI is allocated so as to fulfill the 
condition on the channels in Proposition 2] with this choice of 
and the matrices H*^^^ and H. The difference between the 
j-th row of the inverse of H'^-') and the inverseH is known at 
TX j with an error of the order 0{fi'^°). Using (??) we can 
further bound it to get 



< }_ 
p 



(37) 



In fact, only the j-th row of the precoder is used at TX j such 
that after a stream by stream normalization, the condition given 
in (fTSl l is fulfilled and the maximum MG is achieved. 

Therefore, we need to prove that the conditions for the chan- 
nel matrices is fullfilled to conclude the proof. Let consider in 
a first step the requirement for the CSI on the channel vector 
hj at TX j. To fullfill the condition of the theorem, the error 
made on the estimation of the channel vector hi should not be 



larger than a 0{fi^^«). Using 
as 



we can then lower bound 



1 



To have the estimation smaller than the RHS, it is necesssary 
to have a number of bits equal to 



(38) 
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(39) 



when RVQ is used to encode hj. The expression in ( |39] l is 
exactly the formula given in the Theorem, and the expressions 
for the other channel vectors hi with £ ^ j follow exactly on 
the same way, only with Kq replaced by Kq — |£ — j|. This 
concludes the proof of (l3Tl l. 

The start of the proof of (|32] | is exactly the same. The 
only difference is that to obtain an estimation error of the 



order 0(/i2^o^'/(/i'^^o^'), less bits are necessart when using 



,'2K' 



scalar quantization with the knowledge of the variance of 
the coefficients. Let consider without loss of generality the 
quantization of hj. Then, Hji has a variance equal to /i^l'^^l. 
Thus, only log2(/^^ less bits are needed to obtain the 
same accuracy as for the direct element Hjj . Using this result 
in (EB gives ■ 

The CSI allocation ( [32] l is based on the same property of the 
precoder as in the Wyner model that the off-diagonal elements 
in the inverse are exponentially decaying. The modified CSI 
allocation (l32b makes use of the knowledge of the variance 
of the coefficients inside the channel vectors so that a scalar 
with a smaller variance is quantized with less bits. 

The constant /i' is not known beforehand but can realisti- 
cally be estimated based on long term information. Further- 
more, we have the strong belief that it is in fact equal to /i. 
The CSI allocation derived requires much less bits than the 
conventional one for the BC. Particularly, it also consists in 
limiting the cooperation to a local scale which leads to the very 
'l^(§dd*aHng of the total number of bits when the number of 
cooperating TXs increases. 

Proposition 5. Allocating the CSI according to (1311) leads 
to a quadratic scaling of the total number of feedback bits 
in K, while using the CSI allocation i32\ leads to a linear 
scaling, which means that the number of CSI bits by TX does 
not increases with K. 

Proof: The proof of the quadratic scaling of dsTl ) is 
exactly the same as for the Wyner model, and in fact is already 
proven in the proof of |4] since only the CSI of 2Kq users is 
shared with TX j. The proof of the linear scaling of (l32b 
follows the same pattern as uses the fact that only a finite 
number of the channel vector hi are quantized with a positive 
number of bits. ■ 
The restriction on the cooperation to a local scale is very 
interesting as it allows for the cooperation of very large num- 
ber of TX/RXs since there is no cost induced: the cooperation 
remains on the cooperation scale. 

C. Broadcasted CSI 

The broadcasting of the CSI by the RXs instead of trans- 
mitting it only to its own TX, has appeared recently as an 
interesting practical scheme to transmit the CSI to the different 
cooperating TXs as a reduced cost. A fundamental question 
has been to determine whether the Broadcasting of the CSI 
leads somehow to an optimal CSI allocation optimized by 
"nature". Indeed, the intuition is that a TX located far away 
receives an estimate with low accuracy but also interfers little 
so that it does not require a very good accuracy. From the 
analysis of the Wyner model, we know that this is not alway 
the case. Indeed, a given channel estimate should also be 
shared to TXs with which this user has no direct link. This is 
not anymore the case if you remove the intermediate TX/RX 
pairs. Indeed, the two TXs have an impact on each other 
because of the interference emitted to intermediate TXs. 

In the particular case considered, the "indirect links" are 
smaller than the direct links such that this leads to the property 
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Fig. 3. Average rate per user in terms of the inter-cell attenuation factor fi 
for a normalized transmit power per TX equal to 20 dB. 

that broadcasting the CSI leads to achieve close to the maximal 
MG as giving in Theorem |2l Indeed, it can be easily seen 
by inspection that the rate whish can be obtained through 
the interfering links correspond to the accuracy required by 
Theorem |2] This is not further investigated for the moment, 
but we will show that this is confirmed by the simulations. 

D. Simulations for the Exponentially Decaying Channels 

In this section, we provide simulations for an exponentially 
decaying channel defined according to dZTb . We consider a 
network made of X = 25 cooperating TXs and we consider 
the same CSI allocation solutions as for the simulations in 
the Wyner model in Section IIV-DI For the Decaying CSI 
allocation, we use the expression [32l and we also show an 
additional curve whish correponds to the broadcast of the CSI. 
Finally, we use fj.' — fj. as an approximation of the true value 

of /i'. 

In Fig. [3] we show the average rate per user achieved as a 
function of the SNR. We can observe how the conventional 
clustering perform badly, as does the uniform power allocation 
which would become efficient only at extremely high SNR. We 
can see how the decaying CSI allocation leads to practically no 
losse compared to the sum rate, while the broadcasting of the 
CSI also performs very well, note that due to the cooperation 
at only a local scale, only less than a percent of the total CSI 
is required by the decaying CSI allocation. 

In Fig. m the sum rate is shown in terms of the intercell 
attenuation coefficient. We can observe how the decaying CSI 
allocation performs as the ZF with perfect CSI while it is 
outperformed by the overlapping cluster allocation as the value 
of the coefficient increases. Finally, in Fig. |5j we show the 
logarithm of the amplitude of the precoding coefficient for a 
given stream normalized by the power used. We also show 
the logarithm of the amplitude of the channel coefficients 
with the indices in abscisse ordered so as to correspond to 
decreasing channel coefficients. We observe that the precoder 
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Fig. 4. Average rate per user in terms of the inter-cell attenuation factor /i 
for a normalized transmit power per TX equal to 20 dB. 
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Fig. 5. Average rate per user in terms of the inter-cell attenuation factor /i 
for a normalized transmit power per TX equal to 20 dB. 

coefficient are then also decreasing and that the decreasing is 
also exponential. 

V. Conclusion 

In this work, we have studied the optimization of the CSI 
allocation in the Distributed CSI-MIMO channel. For the 
Wyner model, we have derived a CSI allocation achieving 
good performance with a large reduction in the amount of CSI 
feedback needed. This is especially true for large networks as 
the scaling of the total number of CSI feedback bits is reduced 
so that the number of bits allocated to a TX does not increase 
with the number of cooperating TXs. This approach has then 
been extended to the conventional Rayleigh fading setting if 
the channel is exponentially decaying. By simulations, we 
have confirmed the strong potential of the proposed approach 
where the CSI allocation is strongly reduced (up to only 
a few percents of the original CSI allocation are needed) 



at the cost of reduced perfonnance losses. Studying further 
the application of this scheme to more practical transmission 
scheme is a very intereting direction of research. Particularly, 
we have shown that broadcast the CSI leeds to a achieve close 
to the CSI allocation derived. 
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